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ABSTRACT 

Brd4 is a double bromodomain protein that has been 
shown to interact with acetylated histones to 
regulate transcription by recruiting Positive 
Transcription Elongation Factor b to the promoter 
region. Brd4 is also involved in gene bookmarking 
during mitosis and is a therapeutic target forthe treat- 
ment of acute myeloid leukemia. The Drosophila 
melanogaster Brd4 homologue is called Fs(1)h and, 
like its vertebrate counterpart, encodes different 
isoforms. We have used ChlP-seq to examine the 
genome-wide distribution of Fs(1)h isoforms. We 
are able to distinguish the Fs(1)h-L and Fs(1)h-S 
binding profiles and discriminate between the 
genomic locations of the two isoforms. Fs(1)h-S is 
present at enhancers and promoters and its amount 
parallels transcription levels. Correlations between 
the distribution of Fs(1)h-S and various forms of 
acetylated histones H3 and H4 suggest a preference 
for binding to H3K9acS10ph. Surprisingly, Fs(1)h-L is 
located at sites in the genome where multiple insula- 
tor proteins are also present. The results suggest that 
Fs(1)h-S may be responsible for the classical role 
assigned to this protein, whereas Fs(1)h-L may have 
a new and unexpected role in chromatin architecture 
by working in conjunction with insulator proteins to 
mediate intra- or inter-chromosome interactions. 

INTRODUCTION 

The Bromo and Extra Terminal (BET) family of proteins 
has been reported to play a role in numerous functions 
including transcription regulation, chromatin boundary 
maintenance, genome structure, cell cycle regulation and 
viral replication and segregation (1,2). These proteins are 
characterized by two bromodomains capable of binding 



two acetylated lysines each, and an extra-terminus (ET) 
domain necessary for interactions with other proteins. 
Female sterile (1) homeotic [Fs(l)h] is the only BET 
family member in Drosophila melanogaster and is the 
closest orthologue to Brd4 in mammals (3-5). Because it 
is the only member of this protein family in Drosophila, 
Fs(l)h hkely carries out many of the functions of BET 
proteins described in other organisms. 

The fs(l)h gene encodes two isoforms, long [Fs(l)h-L] 
and short [Fs(l)h-S], both of which contain the tandem 
double bromodomains and the ET domain. The long 
isoform is identical to the short, but it contains an add- 
itional carboxy-terminal motif (CTM) that is essential 
for normal development of D. melanogaster (6). The best 
characterized binding partners of Brd4 via the bromo- 
domains are histones, known to become hyperacetylated 
on transcriptional activation (7). Peptide binding assays 
have shown that the first bromodomain of Brd4 has a 
much higher affinity for acetylated H4, whereas the 
second seems to recognize acetylated residues in both the 
H3 and H4 tails with similar but weaker affinity (8,9). In 
mammals, once bound to the hyperacetylated histones at 
the promoter, Brd4 recruits Positive Transcription 
Elongation Factor b (P-TEFb) to the 5' end of genes to 
release paused RNA Polymerase II (RNAPII) into pro- 
ductive elongation (10). Deletion mutants traced the inter- 
action of Brd4 with P-TEFb to the CTM domain and 
additional regions, such as the second bromodomain, 
present in both the long and short isoforms (11). In 
addition, a recent study demonstrated that recruitment 
of Brd4 to the FOSLl enhancer and promoter is 
required for P-TEFb recruitment on transcriptional acti- 
vation. The recruitment of Brd4 to enhancers is also de- 
pendent on histone acetylation (12). For a large number of 
genes, release of RNAPII from the promoter-proximal 
pause appears to be the rate-hmiting step for transcrip- 
tion. Brd4 has also been demonstrated to have a kinase 
activity capable of phosphorylating the carboxy-teniinal 
domain of RNAPII, a step required for the release of 
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the polymerase into elongation (13). Therefore, 
determining where and when Brd4/Fs(l)h is recruited 
will help our understanding of this regulatory step. 

In addition to transcription regulation, studies of Brd4 
suggest a role for this protein in global chromatin struc- 
ture. A recent study examined Brd4 depleted nuclei and 
reported disruption of chromatin architecture on a large- 
scale leading to nuclear disorganization and genome-wide 
micrococcal nuclease sensitivity. In particular, the Brd4 
long isoform was demonstrated to be responsible for this 
phenomenon (14). In support of this, work in yeast with 
the BET family Bdfl protein has shown this protein to be 
essential for maintaining heterochromatin-euchromatin 
boundaries. When Bdfl is mutated, chromatin boundaries 
are blurred, and genes on either side of the boundary are 
niisregulated, suggesting a role for this protein in the 
estabhshment or maintenance of chromatin domains 
(15). Taken together, it appears that BET proteins play 
a structural role in nuclear organization in addition to the 
classical function in transcription regulation. 

Here, we explore the role of Fs(l)h, the Brd4 homo- 
logue and single BET family member found in 
Drosophila. We present evidence, suggesting that the two 
Fs(l)h isoforms may have distinct roles in chromatin 
architecture and transcription regulation. We analyze the 
genome-wide distribution of the Fs(l)h long and short 
isoforms. The results indicate that the two isoforms have 
different binding profiles, with the short isoform binding 
at promoters and enhancers. Surprisingly, the long 
isoform is present at chromatin insulators. The distribu- 
tion of Fs(l)h-L provides an explanation for the chroma- 
tin architecture phenotypes previously observed, although 
it is possible that the two apparently distinct roles of each 
isoform are functionally related. 

MATERIALS AND METHODS 

Antibodies and Co-IP experiments 

Fs(l)h-L&S rabbit polyclonal antibody was a kind gift 
from Drs D. Huang and I. Dawid (3). Antibodies to 
Fs(l)h-L were prepared by expression of a 220 amino 
acid fragment from the carboxy-terminal domain specific 
to this isoform (see Figure 6A) in Escherichia coli, 
followed by purification and immunization of rabbits 
and guinea pigs. For immunoprecipitation experiments, 
2 X 10* Kcl67 ceUs were lysed in cell lysis buffer [5mM 
PIPES (pH 8.0), 85 mM KCl, 0.5% TX-lOO], nuclei were 
spun down, incubated in 1 ml of RIPA buffer 20 mM Tris- 
HCl (pH 7.5), 150 mM NaCl, 1 mM NazEDTA, 1 mM 
EGTA, 1% NP-40, 1% sodium deoxycholate, 2.5 mM 
sodium pyrophosphate, 1 mM b-glycerophosphate, 1 
mM Na3V04, 1 |ig/ml leupeptin containing complete, 
EDTA-free protease inhibitor cocktail tablets (Roche 04 
693 159 001) and briefly sonicated to ensure lysis. Nuclear 
lysates were treated with 5\A of DNase I (Sigma D5319) 
plus 6mM MgCl2 at 25°C for 20min to obtain single nu- 
cleosomes. Fs(l)h-L antibodies were incubated in lysate at 
4°C for 6 h and precipitated with 50/50 mixture of Protein 
A and Protein G beads (GE Healthcare 17-1279-01 and 
17-0618-01). Ah immunoprecipitated materials were 



washed four times in 1 ml of RIPA buffer and bound 
proteins were eluted in Ix Laemmli buffer and analyzed 
by western blots. For GST pull-down experiments, 1 x 10^ 
KcI67 cells were lysed in 1 ml of RIPA buffer containing 
protease inhibitors (Roche 04 693 159 001) and briefly 
sonicated to ensure lysis. Lysates were treated with 5|il 
of DNase I (Sigma D5319) plus 6mM MgCh at 25°C 
for 20min to obtain single nucleosomes. Three mico- 
hters of the Glutathione S-transferase (GST)-Fs(l)h-L 
specific peptide were incubated in lysate at 4°C for 6h 
and precipitated with 50 ^1 of glutathione-agarose beads 
(Novagen #70541). After four washes in RIPA buffer, 
bound proteins were eluted in Ix Laemnieli buffer and 
analyzed by western blots using antibodies to each of 
the insulator proteins. 

Cell culture and ChlP-seq experiments 

Kcl67 cells were grown in CCM3 serum-free insect media 
(HyClone SH30065.01) at 25°C. ChIP experiments were 
performed using 2 x 10'' KcI67 cells cross-linked in 1 % for- 
maldehyde for lOmin as previously described (16-18). 
Nuclear lysates were sonicated to generate 200-1000 bp 
DNA fragments, incubated overnight with Fs(l)h 
antibodies (3), and washed and eluted following the 
Millipore protocol for ChIP preparation. To generate 
sequencing hbraries, ChIP DNA was prepared for 
adaptor ligation by end repair (End-It DNA End Repair 
Kit, Epicenter Cat# ER0720) and addition of 'A' base to 3' 
ends (Klenow 3'-5' exo-, NEB Cat# M0212S). lUumina 
adaptors (lUumina Cat# PE- 102- 1001) were titrated ac- 
cording to prepared DNA ChIP sample concentration 
and ligated with T4 hgase (NEB Cat# M0202S). Ligated 
Chip samples were PCR-amplified using lUumina primers 
and Phusion DNA polymerase (NEB Cat# F-530L) 
and size selected for 200-300 bp by gel extraction. 
Chip hbraries were sequenced at the HudsonAlpha 
Institute for Biotechnology, using an Illumina HiSeq 2000 
sequencer. 

Data analysis 

Two biological replicates were used for ChlP-seq experi- 
ments, and only peaks found in both replicates were con- 
sidered for further analyses. Sequences were aligned to the 
Drosophila dm3 genome using Bowtie vO.12.7. Peak files 
and wiggle files were generated using MACS v 1.4.1 using a 
/"-value cutoff of 10~'° with an equivalent number of 
reads from sonicated input sequences. The relationship 
between P-values and false discovery rate (FDR) is 
shown in Supplementary Figure SI. ChlP-chip data sets 
for H3K18ac, H3K9ac, H4K8ac, H4K16ac and GAGA 
Factor (GAF) were obtained from modEncode 
(modENCODE # 2996, 3007, 3289, 318 and 2568, respect- 
ively). ChlP-seq data sets for H3K4me3, H3K4mel, 
H3K27ac, H3K27acS28ph, H3K9acS10ph (GSE36374) 
and insulator proteins (GSE30740 and GSE36944) have 
been described previously (16-18). Negative values for 
ChlP-Chip data were set to zero color (white) due to the 
fact that only positive values are obtained for ChlP-seq 
data. 
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Promoter analysis was performed using transcription 
start sites (TSS) sorted by expression level from micro- 
array data from D. melanogaster 60mer expression array 
(NimbleGen catalog no. A43 5 100 1-00-01) containing 
15 634 genes (19). Heatmaps were generated by plotting 
the total number of reads (ChlP-seq) or log base 2 ratio 
enrichment (ChlP-chip) in 20 bp bins covering 1 kb 
upstream and downstream from each site, followed by 
either clustering using Cluster 3.0 or sorting and viewed in 
Java Treeview. Histone acetylation data sets were sorted 
by the least to most correlated: H4K8ac, followed by 
H3K18ac, H4K16ac, H3K9ac and H3K27ac; the bottom 
cluster is not enriched for any of the acetylation marks 
tested. After sorting for the individual acetylation marks, 
the heatmaps for H3K9acS10ph and H3K27acS28ph were 
plotted using the anchors from the sort. 

To determine the significance of the overlap in distribu- 
tions between Fs(l)h-L, Fs(l)h-S, insulator sites and 
histone modifications, we followed the approached of 
Negre et al. (20). Briefly, we first generated permutations 
of insulator and histone modification sites by samphng N 
sites per chromosome from a random distribution of sites 
over the length of the chromosomes. We then mapped 
these simulated sites relative to those of Fs(l)h isoforms, 
and this process was repeated 10"^ times for each combin- 
ation. We calculated P-values by determining the fraction 
of simulations that produced a number of mapped 
features as extreme as that detected in the actual data. 
Using this approach, we find that pairwise interactions 
between Fs(l)h-L and insulator proteins dCTCF, 
Su(Hw), BEAF-32, CP190 and Mod(mdg4) have P- 
values of 10""*. As a control, we find that the interaction 
of Fs(l)h-S with Mod(mdg4) is not significant. 

RESULTS 

Fs(l)h is found at promoters, enhancers and other sites of 
unknown function. Brd4 has been found to play a critical 
role in transcription by releasing RNAPII from the 
promoter proximal pause. Current results suggest that 
this function is carried out by recognition of acetylated 
lysines in histones H3 and H4 tails via the tandem double 
bromodomains. To analyze the role of the Drosophila Brd4 
homologue Fs(l)h in nuclear biology, we determined its 
genome-wide distribution by ChlP-seq using an antibody 
that recognizes both protein isoforms, Fs(l)h-S and Fs(l)h- 
L (3); we will use the term Fs(l)h-L&S to indicate situations 
in which the two isoforms cannot be distinguished experi- 
mentally. We predicted we would find enrichment of 
Fs(l)h-L&S at active enhancers and promoters in light of 
previous reports describing the presence of Brd4 at the 
FOSLl enhancer and promoter in mammalian cells (12). 
To test this possibihty, we compared the distribution of 
Fs(l)h-L&S with that of H3K4me3 and H3K4mel 
obtained by ChlP-seq (16). Peaks of Fs(l)h-L&S identified 
by MACS in two biological replicates {n = 8423) were used 
as anchors to generate heatmaps of H3K4me3, which is 
correlated with active promoters, and H3K4mel, which is 
correlated with enhancers (16). Although we cannot be sure 
that all H3K4mel enriched regions act as enhancers, this 
histone modification best defines this class of regulatory 




Figure 1. Fs(l)h is found at enhancers and promoters. ChlP-seq data 
for Fs(l)h-L&S, H3K4me3 and H3K4mel ± 1 kb from each peak 
summit was K-means clustered using Cluster 3.0 and viewed in Java 
Treeview. Promoter regions contain H3K4me3 but lack H3K4mel, 
whereas enhancer regions are marked by the presence of H3K4mel 
and absence of H3K4me3. Fs(l)li-L&S is present at enhancers and 
promoters. In addition, a subset of sites indicated by question marks 
at the bottom of the heatmap is present in regions of the genome of 
unknown function. 

elements. The heatmaps were then K-means clustered 
using Cluster 3.0, and Fs(l)h-L&S was found at clusters 
enriched with H3K4mel and H3K4me3 (Figure 1). In 
addition, Fs(l)h-L&S occurs in two clusters of unknown 
identity that are not correlated with enrichment of either 
H3K4mel or H3K4me3. The cluster with the strongest 
Fs(l)h-L&S signal faUs into this category, suggesting that 
one or both Fs(l)h isoforms might have a role different 
from that previously studied in transcription activation 
(Figure 1). 

Fs(l)h-S is present at enhancers and promoters 

Studies of fs(l)h mutants in Drosophila have suggested 
different functions for the long and short Fs(l)h 
isoforms based on the different phenotypes observed in 
mutants that affect each isoform specifically (4,21). To 
further explore whether Fs(l)h-L and Fs(l)h-S perform 
distinct functions, we determined differences in their 
genomic distribution by generating an antibody specific 
to the long isoform (Figure 2A). It is not possible to 
obtain antibodies specific to the short isoform because 
the entire length of Fs(l)h-S is identical to the first 1110 
amino acids of Fs(l)h-L. ChlP-seq was then performed 
with this Fs(l)h-L specific antibody, peaks were identified 
using MACS, and those found in both biological repli- 
cates were analyzed (n = 4867). Peaks present in the 
Fs(l)h-L&S data set that are not present in the Fs(l)h-L 
ChlP-seq data set should be specific for the Fs(l)h-S 
isoform (« = 4015). This number is close to but higher 
than the expected 3556 peaks if both antibodies were com- 
pletely efficient and specific. The 11.4% discrepancy 
between expected and observed suggests that the Fs(l)h- 
L antibody recognizes some sites that are not recognized 
by the Fs(l)h-L&S antibody. It is possible that this selec- 
tion approach results in loss of Fs(l)h-S sites that also 
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contain the Fs(l)h-L isoform. Nevertheless, although not 
complete, the Fs(l)h-S data set will contain sites where 
this isoform is present without Fs(l)h-L. Cluster analysis 
shows that the Fs(l)h-S peaks are found in regions con- 
taining H3K4me3 but not H3K4mel. At these sites, the 
levels of Fs(l)h-L are close to background (Figure 2B). 
Scanning of these sites in the genome browser confirms 
differences in binding between the two data sets. Fs(l)h- 
S (sites at which Fs(l)h-L&S is present but Fs(l)h-L is 
absent) is found enriched at promoters marked by 
H3K4me3, and intergenic regions marked by H3K4mel 
and H3K27ac hkely to be enhancers (Figure 2C). From 
these data, it appears that the two isoforms have non- 
redundant distributions and therefore functions, at least 
at most sites in the genome. 
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Figure 2. The Fs(l)h-S and Fs(l)-L isoforms are present in different 
genomic locations. (A) Western blot analysis of Kcl67 cell lysates using 
Fs(l)h antibodies recognizing both the long and short isoforms (left 
panel) and long specific isoform (right panel). (B) Peak summits 
present in the Fs(l)h-L&S data set but not found in the Fs(l)h-L 
data set were used as anchors to generate heatmaps showing co-local- 
ization with H3K4me3 and H3K4mel. The number of reads is plotted 
for each ChlP-seq data set in 20 bp bins covering 1 kb upstream and 
downstream from each unique Fs(l)h-S peak summit. The heatmap 
generated was subjected to K-means clustering using Cluster 3.0 and 
viewed in Java Treeview. (C) ChlP-seq wiggle files for Fs(l)h-L&S and 
Fs(l)h-L were visualized using the SignalMap software program 
(NimbleGen) along with a track of all Dro.sophila annotated genes 
from the UCSC Genome Bioinformatics website. The panel represents 
a region spanning coordinates 815 000-855 000 of chromosome 3L 
(genes CG13900, CG42553, CG42554. CG13901, CG13887, CG13902, 
Gr61a, CG13889, CG13890, dpr20) and shows Fs(l)h-L&S binding at 
active promoters marked by H3K4me3 enrichment, and active enhan- 
cers marked by H3K4mel and H3K27ac enrichment. Fs(l)li-L is absent 
from promoters and enhancers, y-axis represents number of reads. 



The presence of Fs(l)h-S at enhancers and promoters 
correlates with active transcription 

To further investigate the role of Fs(l)h-S present in 
regions enriched in H3K4nie3 presumed to be promoters, 
TSSs from a Kc cell expression array (18,19) were sorted 
by expression level and given strand-specific orientation. 
The enrichment of Fs(l)h-L&S follows the same pattern 
as H3K4me3, a well-characterized histone modification 
correlated with transcription levels (Figure 3A). As 
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Figure 3. Sites of Fs(l)h-S correlate with gene expression and show a 
different binding profile than Fs(l)h-L. (A) TSSs were sorted by ex- 
pression level from highest to lowest and given up/downstream orien- 
tation according to strand. The total number of reads is plotted for 
each ChlP-seq data set in 20 bp bins covering 1 kb upstream and down- 
stream from each TSS and viewed in Java Treeview. (B) Peaks unique 
to Fs(l)h-S and Fs(l)h-L data sets present around TSSs were plotted in 
20 bp bins, and the average profile was plotted covering 200 bp 
upstream and downstream from the TSS and compared with the 
average profile of H3K4me3. Read counts within the window were 
normalized to the total reads within the window and plotted as a 
fold enrichment over the baseline calculated, as the average number 
of reads across the window to compare between data sets. (C) 
Example of a gene with an Fs(l)h-L&S peak shows that the Fs(l)h-L 
summit is found sHghtly upstream of the TSS, and the Fs(l)h-S summit 
is found slightly downstream of the TSS but still has enrichment 
upstream from the Fs(l)h-L cross-reactivity. (D) Enhancers identified 
by H3K4mel enrichment were used as anchors around which the total 
number of reads is plotted for each ChlP-seq data set in 20 bp bins 
covering 1 kb upstream and downstream. The enrichments were then 
sorted by Fs(l)h-L&S intensity and viewed in Java Treeview. 
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expected, the binding profile of Fs(l)h-L is different from 
that of Fs(l)h-L&S. The heatmap in Figure 3A indicates 
that some sites of Fs(l)h-L are present at TSSs, but when 
present close to promoters, the long Fs(l)h isoform 
appears to bind at a slightly different location than 
Fs(l)h-S and does not correlate as well with transcription 
levels (Figure 3 A and B). Fs(l)h-L binds on average at 
—20 bp with respect to the TSS, in what is characterized 
as the histone-free region of the promoter. Conversely, 
Fs(l)h-S is present at +140 bp with respect to the TSS at 
the same location as the peak of H3K4me3, a histone 
modifications known to occur at promoters along with 
hyperacetylated histone H3 and H4 to which Fs(l)h is 
reported to bind during transcription activation 
(Figure 3B). The location of the summit of Fs(l)h-L is 
slightly upstream from the TSS, whereas the Fs(l)h-S 
summit is slightly downstream at promoters having both 
isoforms (Figure 3C), supporting the hypothesis that the 
two isoforms perform different functions. The results also 
suggest that, if the bromodomains of the two isoforms are 
involved in their targeting to chromatin, they may do so 
by recognizing acetylated residues in different proteins. 

The initial clustering presented in Figure 1 suggests that 
there are significant numbers of Fs(l)h-L&S peaks at 
regions enriched for H3K4mel and lacking H3K4me3, a 
pattern of histone modifications characteristic of enhan- 
cers. To further explore the presence of Fs(l)h at these 
sequences, a previously published set of enhancers (16) 
was used to generate enrichment heatmaps of Fs(l)h- 
L&S, Fs(l)h-L and H3K27ac, a histone modification 
found at active enhancers. Sorting by Fs(l)h-L&S inten- 
sity shows a correlation with H3K27ac and absence of 
Fs(l)h-L, suggesting that the enrichment from the 
Fs(l)h-L&S data set is contributed specifically by the 
Fs(l)h-S isoform (Figure 3D). Although all regions with 
H3K4mel enrichment may be enhancers, the correlation 
of Fs(l)h-L&S binding with the H3K27ac modification, 
carried out by the histone acetyltransferase CEP, 
supports a role in active transcription (22). 

The Fs(l)h-S isoform is present at sites where histone H3 
is phosphoacetylated 

Current results suggest that Brd4 function is carried out 
by recognition of acetylated lysines in histones H3 and 
H4 tails via the tandem double bromodomains to recruit 
P-TEFb. This correlates well with the presence of hyper- 
acetylated histone tails described at active enhancer and 
promoter regions, whose role could be recruitment of 
Brd4 to elicit productive elongation by tethering P-TEFb 
to the promoter. However, it is not clear which acetylated 
lysine residues on the histones H3 and H4 tails are bound 
by the Brd4 bromodomains in vivo. In vitro peptide 
binding assays of each of the bromodomains have 
resulted in conflicting data, and a recent comprehensive 
analysis has shown that binding of Brd4 in vitro is not 
specific for any particular lysine residues. Instead, it 
appears that any combination of multiple acetylated 
lysines will result in Brd4 binding (8). To test whether 
this is also the case in vivo, Fs(l)h-L and Fs(l)-S specific 
peaks were used as anchors to cluster genome-wide ChIP 



data sets for H4K8ac, H4K16ac, H3K9ac, H3K18ac, 
H327ac, H3K9acS10ph and H3K27acS28ph. Sorting of 
acetylation marks from the least abundant (H4K8ac) to 
the most abundant (H3K27ac) demonstrates that Fs(l)h-S 
sites (Figure 4A) are more enriched for acetylated histones 
than Fs(l)h-L (Figure 4B). Most Fs(l)h-L sites show no 
enrichment for any of the histone acetylation marks 
investigated. A quantitative analysis of the relationship 
between the genomic locations of Fs(l)h isoforms and 
these histone modifications is presented in the scatter 
plots shown in Supplementary Figures S2 and S3. 

Fs(l)h-S sites do not appear to co-localize extensively 
with regions of enrichment for H4K8ac (Figure 4A), 
contrary to results showing strong in vitro binding of 
mammahan Brd4 to peptides containing this modification 
(8,23). Similarly, H3K18ac, a mark reported to be carried 
out by CBP at enhancers and promoters along with 
H3K27ac (24,25), showed enrichment at only a small 
fraction of Fs(l)h-S sites. H4K16ac, deposited by MOF, 
is also a requirement for Brd4 recruitment at the FOSLl 
enhancer (12). Although about half of all Fs(l)h-S peaks 
show H4K16ac enrichment, this modification is not 
present in the other half These genomic sites of Fs(l)h- 
S must therefore require some combination of acetylated 
residues other than H4K8ac and H4K16ac for recruit- 
ment. H3K27ac, a modification associated with active en- 
hancers as well as promoters, is enriched at most Fs(l)h-S 
sites but has not been shown to bind Brd4 bromodomains 
in vitro (8). Fs(l)h-S overlaps extensively with H3K9ac 
and H3K27ac, which hkely occur simultaneously on 
neighboring residues of the same histone tail. 
Interestingly, phosphoacetylated histone H3 at 
H3K9acS10ph is enriched at most Fs(l)h-S binding sites 
(Figure 4A), and it shows the highest correlation in scatter 
plots (Supplementary Figure S2). This agrees with 
previous results, suggesting that H3K9acS10ph is neces- 
sary for Brd4 recruitment at the FOSLl enhancer and 
promoter on gene activation (12). Taken together, it 
appears that no one individual acetylated histone H3 or 
H4 residue investigated here is essential for Fs(l)h-S 
binding, and it is likely that recruitment of this protein 
to chromatin is carried out by a combination of histone 
acetylation marks that may vary depending on the gene. 
On the other hand, H3K9acS10ph and H3K27acS28ph 
are the only modifications investigated here that are 
present at most Fs(l)h-S sites. As the phosphorylation 
of these two serine residues is carried out by JIL-1 in 
Drosophila (16), these results support a role for this 
kinase in recruitment of Fs(l)h to release paused 
RNAPII and elicit productive elongation. Interestingly, 
and in agreement with a different role for Fs(l)h-L, this 
isoform does not appear to be associated with acetylated 
histones H3 and H4 at most sites in the genome (Figure 4B 
and Supplementary Figure S2). 

One clear exception is H3K9acS10ph, which co-local- 
izes with both Fs(l)h-S and Fs(l)h-L. The functional sig- 
nificance of the presence of H3K9acS10ph at Fs(l)h-L 
sites is unclear at the moment and will require further 
analysis in view of the results presented later in the text. 
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Figure 4. Fs(l)h-S is preferentially enriched at sites of phosphoacetylated liistone H3. Data sets for H3K18ac, H3K9ac, H4K8ac and H4K16ac are 
from modENCODE and were obtained using ChlP-chip. All other data sets were obtained by ChlP-seq. The number of reads is plotted for each 
ChlP-seq data set indicated at the top in 50 bp bins covering 1 kb upstream and downstream from each site. Each heatmap generated was sorted by 
the lowest enrichment of histone modifications, starting with H4K8ac, followed by H3K18ac, H4K16ac, H3K9ac, and H3K27ac; the bottom cluster 
is not enriched for any of the acetylation marks tested. After sorting for the individual acetylation marks, the heatmaps for H3K9acS10ph and 
H3K27acS28ph were plotted using the same sort. Heatmaps were viewed in Java Treeview. (A) Fs(l)h-S specific peaks. (B) Fs(l)h-L specific peaks. 



Fs(l)h-L binds insulator sites with multiple insulator 
proteins 

To gain insights into the possible nature of the sequences 
where Fs(l)h is present in the unknown cluster shown in 
Figure 1, we first analyzed whether Fs(l)h-L or Fs(l)h-S is 
present at these sites (Supplementary Figure S4). Motif 
analysis was performed on the Fs(l)h-L&S peaks from 
these clusters to try to identify DNA-binding proteins 
with which Fs(l)h might associate at these genomic loca- 
tions. In Drosophila, there are five characterized insulator 
DNA-binding proteins, Drosophila CCCTC-binding factor 
(dCTCF), Suppressor of Hairy-wing [Su(Hw)], GAGA 
Factor (GAF), Boundary element-associated factor of 
32kDa (BEAF-32) and Zeste-white 5 (Zw5) (17,18,26). 
Surprisingly, Multiple EM for Motif Elicitation (MEME- 
Chip) (27) identified binding motifs for several of these 
insulator proteins, including BEAF-32, dCTCF, GAF 
and Su(Hw), suggesting a possible association of Fs(l)h 
with insulators that has not been previously reported. 
Enrichment profiles viewed in a genome browser show 
Fs(l)h-L enriched at insulator sites, particularly in loca- 
tions where multiple insulator proteins are present. 
Interestingly, the overlap of Fs(l)h-L at peaks of individual 
insulator binding proteins is less pronounced (Figure 5A 
and Supplementary Figure S5). Overall, Fs(l)h-L overlaps 
with 2373 peaks of CTCF, 3329 peaks of Su(Hw), 2650 
peaks of CP190 and 2433 peaks of Mod(mdg4). To 
explore the relationship between the genomic locations of 
Fs(l)h isoforms and insulator proteins, we performed 
K-means clustering with ChlP-seq data sets for these 
proteins using Fs(l)h-L&S sites as anchor points. The 
heatmaps shown in Figure 5B indicate that Fs(l)h-S 
specific sites (top) are depleted of insulator proteins (see 
also Supplementary Figure S6), whereas Fs(l)h-L sites 
are enriched for all insulator DNA-binding proteins with 
the exception of GAF. We have recently shown that 



insulators containing binding sites for multiple insula- 
tor proteins constitute a special subset, named 'aUgned 
insulators', which are enriched at the borders between 
topologically associating domains defined by Hi-C (17). 
Therefore, sites where Fs(l)h-L is present appear to cor- 
respond to sites of ahgned insulators (Figure 5B). We 
have previously shown that some insulator sites are 
located close to promoters, whereas results in Figure 
2A-C show that some Fs(l)h-L sites are present in the 
nucleosome-free region upstream of TSSs. To further 
explore this issue, we carried out K-means clustering 
of ChlP-seq data sets for Fs(l)h-L and insulator 
proteins using TSSs as anchors and sorted by expression 
levels of the corresponding genes. The results show an 
overlap between Fs(l)h-L and DNA-binding insulator 
proteins, with the exception of GAF, at this region 
(Figure 5C). Mod(mdg4) and CP 190 are non-DNA- 
binding insulator proteins that help mediate interactions 
between the different insulator DNA-binding proteins 
(18,28,29). To examine the relationship of the location 
of these two proteins to that of Fs(l)h-L, we performed 
K-means clustering of insulator protein ChlP-seq data 
sets, including CP190 and Mod(nidg4), using Fs(l)h-L 
peaks as anchors. A subset of Fs(l)h-L sites colocalize 
with ahgned insulators where BEAF-32, dCTCF, 
Su(Hw), CP 190 and Mod(mdg4) are present (Figure 
5D). In addition, the results show that the pattern of 
Fs(l)h-L distribution most closely resembles that of 
Mod(mdg4) (Figure 5D and E). 

The co-localization of Fs(l)h-L with insulator proteins 
does not necessarily indicate a direct interaction. To deter- 
mine whether Fs(l)h-L physically interacts with any 
of these insulator proteins in vivo, co-immunoprecipitation 
experiments were carried out with antibodies to the Fs(l)h- 
L isoform. Supporting the ChlP-seq data, interactions were 
detected between the Fs(l)h-L isoform and GAF, BEAF- 
32, Mod(mdg4) and CP190 (Figure 6A). The reciprocal 
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Figure 5. The Fs{l)h-L isoform is found at insulator sites. (A) ChlP-seq wiggle files for Fs{l)h isoforms, and insulator proteins were visualized using 
the SignalMap software program (NimbleGen) along with a track of all Drosophila annotated genes from the UCSC Genome Bioinformatics website 
in the region covering 3L: 486 000-552 000. The panel shows Fs(l)h-L present at sites containing multiple insulator proteins but not at sites where 
only one protein is found. (B) The total number of reads is plotted for each ChlP-seq data set indicated above in 20 bp bins covering 1 kb upstream 
and downstream from each Fs(l)h peak present in Fs(l)h-L&S but not in Fs(l)h-L (top), each Fs(l)h peak present in both Fs(l)h-L&S and Fs(l)h-L 
(middle) and peaks found in Fs(l)h-L but not Fs(l)h-L&S (bottom) (C) The total number of reads is plotted for each ChlP-seq data set indicated 
above in 20 bp bins covering 1 kb upstream and downstream from all annotated TSS sorted by expression level and viewed in Java Treeview. 
(D) Data sets for each of the indicated proteins were plotted in 20 bp bins covering 1 kb upstream and downstream from each Fs(l)h-L peak summit. 
The heatmap generated was subjected to K-means clustering using Cluster 3.0 and viewed in Java Treeview. (E) The Fs(l)h-L and Mod(mdg4) data 
sets from D were sorted by Fs(l)h-L intensity to show the correlation between the location and abundance of the two proteins. 
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Figure 6. Fs(l)h-L interacts with insulator proteins. (A) Co-IP experiments with a protein lysate from Kcl67 cells (Input) were performed using 
either IgG (control) or antibody against Fs(l)h-L (guinea pig). Western analysis was performed on all samples using antibodies designated on the 
right side of the panel at 1:10 volume of input to IP. (B) Co-IP experiments were performed as described earlier in the text with only one 
immunoprecipitation from the lysate using antibodies against each of the indicated insulator proteins followed by western blotting with Fs(l)h 
antibodies that recognize both isoforms. (C) Organization of the Fs(l)h locus. Amino acids 1169 through 1389 were placed in an expression vector 
with a GST tag at the N-terminus and Hisx6 tag at the C-terminus for expression in E. coli followed by purification to 1.3 ng/nl using the Hisx6 and 
GST tags. (D) The Fs(l)h-L specific peptide was incubated with a Kc cell protein lysate, precipitated with glutathione-agarose beads and analyzed by 
western blot to test for interaction with each of the insulator proteins. Results suggest that GAF, Su(Hw), CP190 and Mod(mdg4) interact, directly 
or indirectly, with the CTM domain of Fs(l)h-L. 
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co-IP experiment, using antibodies to various insulator 
proteins followed by western analysis using antibodies 
that recognize both Fs(l)h isoforms, confirms the 
previous observation, suggesting that GAF, Su(Hw), 
CP 190 and Mod(mdg4) interact with Fs(l)h-L 
(Figure 6B). One apparently contradictory result from 
these experiments is the observation of only a partial 
overlap between GAF and Fs(l)h-L in ChlP-seq experi- 
ments (Figure 5B and D), whereas the co-IP experiments 
suggests a strong interaction between GAF and Fs(l)h-L 
(Figure 6A and B). One possible explanation is that the 
interaction between these two proteins is indirect, through 
looping between sites containing GAF and sites containing 
Fs(l)h-L with other insulator proteins. Although the 
proteins are not located at the same genomic locations, 
they come down together in co-IP experiments. A co-IP 
experiment to test possible interactions between insulator 
proteins and Fs(l)h-S could not be done, as the Fs(l)h-L&S 
antibodies recognize both isoforms. However, results in 
Figure 6B suggest that antibodies to insulator proteins 
fail to pull down Fs(l)h-S, supporting the conclusion that 
this isoform does not interact with insulators. To further 
test the idea of an interaction between Fs(l)h-L and insu- 
lator proteins, a 220 amino acid GST-tagged Fs(l)h-L 
specific peptide containing the CTM domain of this 
isoform (Figure 6C) was incubated with a Kc cell protein 
lysate, precipitated with GSH-agarose beads and analyzed 
by western blot using antibodies to various insulator 
proteins. Results suggest that GAF, Su(Hw), CP190 and 
Mod(mdg4) interact, directly or indirectly, with the CTM 
domain of Fs(l)h-L (Figure 6D). Therefore, despite the fact 
that Fs(l)h-S and Fs(l)h-L have identical sequences except 
for the extra carboxy-terminal domain of Fs(l)h-L, these 
data suggest a specific function for the later at insulator 
sites, and this function is carried out by the extra 932 
amino acids found on the Fs(l)h-L isoform. 



DISCUSSION 

Regulation of transcription by restricting the release of 
promoter-proximal paused RNAPII appears to be the 
rate limiting step for the regulation of many genes. 
Release from this promoter-proximal pause requires re- 
cruitment of P-TEFb, which is carried out by Brd4 in 
mammals. Brd4 has recently been identified as a thera- 
peutic target for acute myeloid leukemia and other 
cancers (30,31), but the mechanisms by which it affects 
gene expression are not weU understood. Brd4 reminds 
bound to chromatin in mitotic chromosomes, and the 
presence of the two bromodomains suggests that it inter- 
acts with acetylated histones (7,32,33). Recent results 
show a preference of Brd4 for binding to polyacetylated 
histone H3 and H4 peptides in vitro, including K5, K8, 
K12 and K16 in histone H4 and K14 in histone H3 (8). 
The affinity of Brd4 for acetylated histone peptides in- 
creases when neighboring S and T residues are 
phosphorylated. Once tethered at the promoter by inter- 
actions with acetylated histones, Brd4 recruits P-TEFb, 
which in turns phosphorylates the Ser2 residue of 
RNAPII (10). Brd4 is also an atypical kinase that can 



interact with the carboxy-terminal domain of RNAPII 
and phosphorylate Ser2 directly (13). 

Brd4 encodes two different isoforms but, despite its 
importance as a regulator of transcription and its signifi- 
cance as a target for cancer therapeutics, the possible func- 
tional differences between the two isoforms have not been 
explored. Here, we show that Fs(l)h, the Drosopliila Brd4 
orthologue, is present at active promoters and likely plays 
a general role in transcription regulation as is the case in 
mammals. Importantly, we find that the two Fs(l)h 
isoforms display different genomic localization patterns, 
suggesting that they may play distinct roles in nuclear 
biology. We find that the Fs(I)h-S isoform is present at 
enhancers and promoters of active genes. The summits of 
Fs(l)h-S peaks at promoters are located at +140 bp from 
the TSSs and coincide with the summits of H3K4me3 
peaks, suggesting that Fs(l)h-S may interact with 
histones. This is confirmed by the strong clustering of 
Fs(l)h-S peaks and those of histones H3 and H4 
acetylated in various residues. The results suggest a non- 
uniform distribution of acetylated histones at different 
genomic sites of Fs(l)h-S, suggesting that perhaps 
Fs(l)h-S can be recruited to different genes by different 
acetylated residues. This would allow different histone 
acetyltransferase complexes with different specificities to 
participate in transcriptional regulation of different genes 
with the same mechanistic outcome. Interestingly, we find 
that Fs(l)h-S correlates best with the presence of the 
phosphoacetylated marks H3K9acS10ph and 
H3K27acS28ph. Phosphorylation of SIO and S28 in the 
tail of histone H3 is carried out by the JIL-1 kinase, and 
these two modifications are found at all active promoters 
and enhancers in Drosophila (16). JIL-1 is in turn neces- 
sary for recruitment of P-TEFb and release of the paused 
RNA polymerase (34,35). The finding of Fs(l)h-S at sites 
of phosphoacetylated H3 supports the proposed role of 
JIL-1 in the transcription process and offers additional 
mechanistic insights for how JIL-1 contributes to the 
release of RNAPII from the promoter-proximal pause. 

It is possible that recruitment of Brd4 requires acetyl- 
ation events that do not involve histone tails but may act 
combinatorially to tether this protein to the promoter. We 
have previously demonstrated that CBP, the histone 
acetyltransferase that carries out H3K27 acetylation at 
both enhancers and promoters, is required for phosphor- 
ylation of serine 2 of the C-terminus tail of RNA polymer- 
ase II (16). Studies of P-TEFb in mammalian cells have 
shown that p300 acetylates Cychn T of the P-TEFb 
complex to release it from inactivating factors (36). The 
acetylated lysine residues on Cychn T can then bind the 
bromodomain of Brd4 (37). This suggests that Brd4 may 
serve as a bridge between acetylated histone tails and other 
acetylated residues in Cyclin T to tether P-TEFb to the 
promoter. Interestingly, a subset of Fs(I)h-S sites show 
low enrichment of H3K27ac (Figure 4A), suggesting that 
either recruitment of Fs(l)h-S requires simultaneous phos- 
phorylation of the adjacent S28 residue or that acetylation 
of other residues take place at other genes. The possibihty 
of different combinations of acetylated histone residues 
being able to recruit Fs(l)h-S in Drosophila and Brd4 in 
mammals would explain the discrepancies between 
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different studies, demonstrating different histone acetyl- 
ation requirements for Brd4 recruitment to chromatin 
(12,38). 

Although the recruitment of Brd4 to the FOSLl 
enhancer has been previously shown to be required for 
recruitment of P-TEFb and productive transcription 
elongation (12), the widespread presence of Brd4 at en- 
hancers has not been reported before as a general mech- 
anism of transcriptional activation. The data presented 
here demonstrate binding of Fs(l)h-S at the majority of 
active enhancers in the Drosophila genome. Other events 
that take place at the promoter and are required for re- 
cruitment of P-TEFb and release of RNAPII, such as re- 
cruitment of JlL-1, 14-3-3 and histone acetyltransferases, 
also occur at enhancers genome wide (16). This raises the 
possibihty that binding is seeded at the enhancer and, on 
contact of the enhancer with the promoter, Fs(l)h-S can 
bring P-TEFb to the promoter to release the polymerase. 
Alternatively, Fs(l)h-S and P-TEFb may have alternative 
functions at enhancers, such as transcription of eRNAs 
required for enhancer function (39-41). A third specula- 
tive but interesting possibility is that various signahng 
pathways in the cell contribute to building a code of epi- 
genetic signatures at enhancers and promoters in the 
form of acetylation of various residues, which may then 
cooperate to recruit Brd4/Fs(l)h. The requirement for 
acetylation of histones at enhancers and promoters 
to recruit Brd4 would ensure that several different sig- 
naling events have taken place before recruitment of 
PTEF-b by Brd4 can release RNAPll into productive 
elongation. 

Contrary to Fs(l)h-S, Fs(l)-L is present in the histone- 
free region overlapping the binding sites of insulator 
proteins, suggesting that this isoform may not be recruited 
to chromatin by interactions between the bromodomains 
and acetylated histones. Rather, it is more likely that 
Fs(l)h-L recognizes acetylated residues in other proteins, 
as has been shown for the interaction between Brd4 and 
Cyclin T. Given the strong correlation between the genomic 
sites of insulator proteins and Fs(l)h-L, particularly 
between Mod(mdg4) and Fs(l)h-L, it is possible that this 
isoform is recruited to insulator sites by acetylated residues 
in insulator components. The role of Fs(l)h in the function 
of chromatin insulators is not clear at this time, but such a 
possibihty is supported by previous work in yeast, where 
Bdfl, a BET family protein, has been shown to be involved 
in the function of chromatin boundaries (15). In addition, 
depletion of Brd4 in mammalian cells results in large-scale 
chromatin unfolding. This effect requires the carboxy- 
terminal domain of Brd4, suggesting the long isoform is 
involved in this process (14). 

Although Fs(l)h/Brd4 has been regularly studied in the 
context of its role in transcription elongation, our genome- 
wide interrogation of Fs(l)h suggests that the poorly 
characterized long isoform of this protein localizes to 
sites bound by insulator proteins throughout the 
genome. This observation may suggest a role for Fs(l)h/ 
Brd4 in the establishment or maintenance of chromosome 
architecture. 
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